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Phenylalanine hydroxylase from Legionella pneumophila (IpPAH) has a major functional role in the 
synthesis of the pigment pyomelanin, which is a potential virulence factor. We present here the crystal 
structure of IpPAH, which is a dimeric enzyme that shows high thermostability, with a midpoint denat- 
uration temperature of 79 °C, and low substrate affinity. The structure revealed a dimerization motif 
that includes ionic interactions and a hydrophobic core, composed of both j3-structure and a C-terminal 
region, with the specific residues (P255, P256, Y257 and F258) interacting with the same residues from 
the adjacent subunit within the dimer. This unique dimerization interface, together with a number of 
aromatic clusters, appears to contribute to the high thermal stability of IpPAH. The crystal structure 
also explains the increased aggregation of the enzyme in the presence of salt. Moreover, the low affinity 
for substrate L-Phe could be explained from three consecutive glycine residues (G181, 182, 183) located 
at the substrate-binding site. This is the first structure of a dimeric bacterial PAH and provides a frame- 
work for interpreting the molecular and kinetic properties of IpPAH and for further investigating the 
regulation of the enzyme. 

© 2013 The Authors. Published by Elsevier B.V. on behalf of Federation of European Biochemical 
Societies. All rights reserved. 



1. Introduction 

The gram-negative bacterium Legionella pneumophila belongs to 
the y-proteobacteria. In nature, L pneumophila is an inhabitant of 
warm freshwater habitats where its multiplication is mainly re- 
stricted to intracellular niches inside amoebal hosts and after infec- 
tion it continues this intracellular life-style within the human host by 
multiplying inside macrophages [1 ]. Humans are a dead-end host for 
this pathogen, but it still causes outbreaks of Legionnaires' disease, 
a potentially fatal form of pneumonia, when it multiplies in warm, 
stagnant water that are spread in aerosols through human-made in- 
stallations such as fountains and showers [2-4]. In such aqueous en- 
vironments L pneumophila grows well at temperatures in the range 
20-48 °C, but its tolerance to higher temperatures can lead to re- 
growth after heat-disinfection [5]. The number of reported cases of 
Legionnaires' disease has increased in the last decade [6], making 
eradication of the bacterium from infection sources an important 
task. Thus, there is a need for research on possible targets, notably on 
putative virulence factors. 
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L pneumophila is one of several known pathogenic bacteria whose 
genomes encode a phenylalanine hydroxylase (PAH). PAH is the en- 
zyme that catalyzes the hydroxylation of L-phenylalanine (L-Phe) to l- 
tyrosine (L-Tyr) using non-heme Fe(II) and tetrahydrobiopterin (BH4) 
as cofactors, and dioxygen as additional substrate [7]. In mammals, 
PAH has a major catabolic role and its dysfunction is associated to 
deleterious hyperphenylalaninemia and phenylketonuria, whereas 
as we have recently shown [8] L pneumophila PAH (IpPAH) has an 
important role in the growth of the bacterium in media deficient in 
L-Tyr and in the synthesis of a brownish pigment called pyomelanin. 
Like some of the other bacteria that encode a PAH, L pneumophila 
produces and excretes pyomelanin when L-Phe or L-Tyr is available 
[9]. Pyomelanin is produced via the catabolism of L-Phe/L-Tyr when 
an intermediate of the pathway, homogentisate, accumulates and 
subsequently autooxidises and polymerises. In L pneumophila, the 
pyomelanin has ferric reductase properties, scavenging and reducing 
extracellular iron [10]. Iron is an essential metal for both human host 
and bacterial pathogens, among other as cofactor for many enzymes, 
including PAH. Iron is indeed critical for intracellular infection by 
L pneumophila [11] and production of pyomelanin is one of several 
strategies for iron assimilation [10]. Molecular and enzyme kinetic 
characterization has shown that IpPAH is well suited for catalysis 
at human body temperature, revealing a maximum activity around 
40 °C and very high specific activity compared to other eukaryote and 
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prokaryote PAHs [8,12-16]. The enzyme also presented high Km val- 
ues for substrate and cofactor BH4 (735 ± 50 m-M and 125 ± 25 [xM, 
respectively) [8], and the low affinity for L-Phe has been interpreted 
as a plausible regulatory mechanism to preserve threshold amounts 
of the substrate [7]. Similarly to all other PAHs, the activity of IpPAH 
is totally dependent upon a non-heme ferrous ion, but - in contrast to 
all other studied PAHs which are iron-bound tetramers (eukaryotes) 
or monomers (bacteria) [7,15,17] - IpPAH is isolated as an iron-free 
(apo) homodimer when expressed recombinantly [8]. The estimated 
hydrodynamic diameter of apo-lpPAH dimer (8.1 ± 0.1 nm) and its 
surprisingly high thermal stability (midpoint denaturation tempera- 
ture (Tm) = 79 ± 0.5 °C) is only slightly increased by Fe incorporation 
[8]. 

The high thermal stability of IpPAH might be important for preser- 
vation of the enzyme at temperatures where L pneumophila survives 
in a dormant, non-replicative state [18] and, based on a crucial role of 
the enzyme in the synthesis of pyomelanin, we set off to characterize 
the structure-function-stability relationships in dimeric IpPAH. The 
recombinant enzyme was crystallized and its structure has been an- 
alyzed and compared with that of PAHs from mesophilic Chromobac- 
terium violaceum (cvPAH; optimum temperature for growth, 25 °C) 
and psychrophilic Colwellia psychrerythraea 34H (cpPAH; optimum 
temperature for growth, 10 °C) bacteria, as well as with the human 
enzyme (hPAH; optimum temperature, 37 °C). The structural features 
identify a unique dimerization motif and aid to explain the particular 
structural determinants for thermal adaptation of IpPAH, as well as 
its low affinity for L-Phe and BH4 [8]. 

2. Results and discussion 

2. 1 . Refinement and overall structure 

The crystallization of IpPAH was challenging since the crystals 
grew as plates with some disorder found in the final models, and 
due to the low symmetry spacegroup P2i a decent rotation range 
of X-ray data was needed to obtain more that 90% completeness. 
The best crystal diffracted to 2.5 A (Table 1) and in the observed 
electron density maps the dimeric IpPAH structure is clearly defined, 
in particular for chains A and B. Thus, the current model gives good 
insight into the IpPAH structure as described below. 

The final IpPAH model refined to an i?-factor of 27.6% and an i^-free 
value of 30.2% (Table 1 ), which are slightly high but can be explained 
by the disorder found for chains C and D. The final IpPAH model is 
most complete for chains A and B, including residues Val8-Asp259 
(totally 252 residues for both chains), with lowest mean B-factors, 
whereas chains C (232 residues) and D (237 residues) have higher 
mean B-values and many disordered residues left out of the final 
model (Table 1, Fig. la). The observed differences between the chains 
can be explained from the symmetry contacts that chains A and B 
make with one another, whereas chains C and D are facing water 
channels with less symmetry-related protein-protein interactions. 
In the active site of IpPAH extra difference electron density was ob- 
served and interpreted as three polyethylene glycol (PEG; C4H10O3) 
molecules (chains A and B) (Fig. 2a). 

The catalytic domain of other non-heme iron- and BH4-dependent 
aromatic amino acid hydroxylases (AAAHs) has a mixed cx/(3 fold, 
also found for the reported IpPAH structure. The root mean square 
deviation (RMSD; www.ebi.ac.uk/msd-srv/ssm/) of IpPAH compared 
to other structures are: 1.2 A to cpPAH (both PDBs 2v27, 2v28 for 
252 CA-atoms), 1.3 A to cvPAH (both PDB ILTZ/ILTU for 233/232 
CA-atoms) and 1.5 A to hPAH-BH4 (PDB 1J8U for 193 CA-atoms). 

2.2. Dimerization and intersubunit interactions 

In agreement with the dimeric nature of IpPAH in solution [8] there 
are two dimers in the asymmetric unit, formed by either chains A and 



Table 1 

X-ray data collection and refinement statistics for IpPAH. 



X-ray statistics 



IpPAH 



PDB entry 
Beamline 
Space group 
Unit cell 



Resolution (A) 

(highest bin) 
Wavelength (A) 
No. of unique reflections 
Multiplicity 
Completeness (%) 
Mean (</>/<(Ti>) 
R-sym {%y 
Wilson B-factor (A^) 

Refinement 
Resolution (A) 
J?-factor (all reflections) {%y 
R-free {%y 
No. of atoms 
No. of water molecules 
No. of other molecules 
No of residues chain A/B/C/D 
R.m.s.d. bond lengths (A) 
R.m.s.d. bond angles (°) 
Average B-factor (A^) 
All atoms 

Protein (chain A/B/C/D) 
PEG/Water molecules 
Ramachandran plot: 

Most favored regions (%) 
Additionally allowed regions (%) 
Disallowed regions (%) 



4BPT 

Bessy, BL14.1 
P2i 

a = 90.32 A 
fa = 60.12 A 
c= 124.04 A 
^ = 94.07° 
25-2.5 
(2.64-2.5) 
0.91841 
41 892(6 267) 

2.3 (2.3) 
90.9 (94.0) 
8.6(2.2) 

8.4 (39.4) 
41.8 



10-2.5 

27.64 

30.20 

8156 

162 

6 PEG 

252/252/232/237 

0.018 

2.08 

47.2 

29.8/32.2/67.8/63.0 
26.2/47.4 



91.1 

6.4 

2.5 



° i^sym = (EhE/ I W - (/(/i))|)/(EhM/i)), where h{h) is the ith measurement of 
reflection h and </(/i)> is the weighted mean of all measurements of h. 



D or chains B and C (Fig. la). Both dimeric interfaces have similar 
size and the same residues are involved, thus only the AD interface 
will be described. Here there are strong ionic interactions (<4 A) 
involving Aspl98A-Arg247D, Asp228A-Lysl72D, Arg247A-Aspl98D 
and Lysl72A-Asp228D. There are two additional hydrogen bonds 
from A to D and D to A involving Arg202 0 to Leu204 N and one 
longer (3.6 A) polar interaction from Ilel99 0 to Arg212 NHl (Fig. 
lb). Totally there are 8/7 hydrogen bonds, 4/3 ion pairs <4 A and 
additional 3/3 ion pairs 4-6 A for the AD/BC dimers (Table 2). 

Upon dimer formation 21,100 A^, i.e. about 30% of the accessible 
surface area (ASA) for each monomer, is buried (Table 2). Compared to 
other functional dimers, this buried ASA is e.g. similar to that in isoc- 
itrate dehydrogenase from the psychrophilic bacterium Desulfotalea 
psychrophila [19] and larger than in alkaline phosphatase from the 
antarctic bacterium TABS and in three other alkaline phosphatases 
[20]. Significantly, the IpPAH dimer interface is very different from 
that of dimeric truncated hPAH (residues 103-428) [21 ], since in the 
latter the C-terminal residues (41 1-424 in hPAH) form two (3-strands 
involved in the dimerization (Fig. Ic). Also, while in hPAH (residues 
103-428) the dimer interface is at the (left) side of the catalytic do- 
main (Fig. Ic), the IpPAH dimer is formed at the top of chain A, with 
different residues involved in dimerization. In IpPAH, hydrogen bonds 
(Arg202 0 to Leu204 N) bind one |3-strand (residues 201-204) from 
chain A to the same strand from chain D, and these residues, together 
with ion-pair forming residues 172, 198, 247 and 228, are located to- 
wards the middle of the protein chain. This differs from the truncated 
hPAH dimer where the interactions only involve C-terminal residues 
(Fig. Ic) [21 ]. Furthermore, the buried ASA of 519 A^ (3.7% of ASA per 
monomer) measured in the hPAH dimer interface, is much smaller 
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Fig. 1. Overall structure of IpPAH. (a) The two IpPAH dimers (chains A-D) in the asymmetric unit, (b) Ionic interactions and (d) stacking interactions at the dimer interface of 
chains A (red) and D (sand), (c) Superimposition of hPAH truncated dimer (subunits in yellow and cyan; PDB IPAH [21]) onto the IpPAH dimer (chains A and D in red and sand, 
respectively), (e) A large aromatic cluster, with the unique IpPAH residues Phe211, Phe236 and Phe244. (f) Residues 163-168 and 225-229 (motifs in green) predicted to be prone 
to (3-aggregation by the TANGO-algorithm, surrounded by ion pairs and aromatic clusters [38] in one IpPAH dimer. (For interpretation of the references to colour in this figure 
legend, the reader is referred to the web version of this article.) 
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Table 2 

Structure and dimer interaction analysis of IpPAH. The calculations are done for the 
residues in the crystal structure only, as indicated. 

IpPAH 



Fig. 2. The IpPAH active site, (a) The active site of apo-lpPAH with three polyethylene 
glycol (PEG) molecules and one water molecule (WAT, in red), (b) IpPAH with modeled 
BH2 (green), and (c) with substrate analogue THA, BH2 and Fe2+ (green) modeled from 
hPAH-BH4-THA (PDB IMMK). (For interpretation of the references to colour in this 
figure legend, the reader is referred to the web version of this article.) 



than in the IpPAH dimer. 

In addition to the ionic interactions, which appear to seal the IpPAH 
dimer together, the interacting interface is composed of the (3-strands 



No. of res. in chain A/B/C/D 
No. of H-bonds per residue in** 

Chain A 

Chain B 

Chain C 

Chain D 
Ion pairs 

No. of <4 A Chain A/B/C/D 

No. of 4-6 A Chain A/B/C/D 
No. of ion pairs per residue^ 

<4 A Chain A/B/C/D 
Dimeric interface 
No. ofH-bonds^ 

Chain A to D 

Chain B to C 
No. ion pairs <4/6 A 

Chain A to D 

Chain B to C 
Accessible surface area (ASA) of dimer (A^ 

Chain A and D 

Chain B and C 
Buried intersubunit surface (% of dimer) 

Chain A to D 

Chain B to C 



252/252/232/237 

0.881 
0.865 
0.853 
0.844 

13/10/4/8 
10/12/5/6 

0.052/0.040/0.017/0.034 



4/2 
3/3 

I 

21,150 
21,065 

30.2 
30.4 



° In crystal structure. 

^ Including ionic interactions. 



Val201-Arg202-Ile203-Leu204-Phe205 and Val223-Tyr224-Phe225- 
Val226-Ile227 from each subunit, and the adjacent hydrophobic C- 
terminal coils (Glu251 -Asp259) where the two Tyr257 residues estab- 
lish an important stacking interaction (Fig. Id). The unique dimeric ar- 
rangement in IpPAH, and notably the presence of these hydrophobic/ 
aromatic interactions, seems to be an important determinant of the 
high thermal stability of the enzyme [8] (see below). 

2.3. Iron dependence 

Even when excess iron is added during protein expression, recom- 
binant IpPAH is purified from Escherichia coU as an apo (metal-free) 
enzyme (0.07 ± 0.03 mol Fe(II)/mol subunit) [8]. Also, when activ- 
ity of the native enzyme is measured in lysates of L. pneumophila, 
addition of 100 |j.M Fe(II) in the assay gives several-fold higher activ- 
ity [8], indicating that the active site of IpPAH is sub-saturated with 
catalytic iron intracellularly. A comparison of LpPAH to cpPAH (PDB 
2v27), cvPAH (PDB ILTV) and hPAH-BH4 (PDB 1J8U), identifies the 
iron-binding residues in IpPAH as Hisl22, Hisl27 and Glul67. In Ip- 
PAH (chain A) a water molecule is found at the similar position as 
Fe(II) in other PAHs in the metal-bound holo-form (Fig. 2a). From the 
observed electron density it is clearly a solvent molecule and not a 
metal ion, in agreement with IpPAH being isolated as an apoenzyme 
[8]. 

2.4. The BH4 cofactor and substrate binding sites 

Available structures of hPAH complexed with BH4 (or BH2) ei- 
ther alone or together with the THA substrate analogue ((hPAH-BH4 
(PDB 1J8U); hPAH-BH4-THA (PDB IMMK) and CVPAH-BH2 (PDB ILTZ)) 
provide a frame for comparative functional analyses. IpPAH shows a 
/Cm(BH4) of 125 ± 25 |j.M, which indicates lower affinity for the co- 
factor compared to hPAH (/<:m(BH4) = 15-29 jxM) [22] and cvPAH 
(/<:m(BH4) = 15-21 |j,M) [16,23]. One structural explanation for the 
lower affinity is that IpPAH cannot provide the same number of hydro- 
gen bonds upon complex formation with BH4 as cvPAH (from Aspl 04) 
and hPAH (from Ser251 ) due to Ala88 (Fig. 2b). The side chain of Ala88 
in IpPAH is hydrophobic and cannot make a hydrogen bond to 0-1' 
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of BH2/BH4. Still, IpPAH should be able to make two hydrogen bonds 
from both Ile86 0 and N as observed in hPAH-BH4 and CVPAH-BH2, 
as shown in Fig. 2b where BH2 is modeled. IpPAH has nevertheless 
higher affinity for BH4 than cpPAH (/<:m(BH4) = 300 ^M) [15] where 
Phe88 (AlaSS in IpPAH) might sterically hamper the cofactor binding, 
thus explaining the poor binding of BH4 to the cold-active cpPAH. 

The enzyme kinetics of IpPAH also indicated a low affinity for its 
substrate, with a /Cm(L-Phe) = 735 ± 50 |j.M [8] compared with a 
/<:m(L-Phe) = 59 ± 10 i^M for cvPAH [16] and /<:m(L-Phe) = 60 ^iM 
for truncated hPAH (residues 103-428) [24]. We studied the struc- 
ture for possible residue determinants of the low affinity for L-Phe 
in IpPAH. The substrate binding-site is best characterized in the 
crystal structure of truncated dimeric hPAH bound to two substrate 
analogs L-norleucine (PDB IMMT) and 3-(2-thienyl)-L-alanine (THA; 
PDB IMMK) [25]. When modeling THA into IpPAH the conserved 
Argl07 interacts with the carboxyl group of L-Phe (Fig. 2c). On the 
other hand, Tyrll4 in IpPAH, which is conserved in all PAHs, is not 
in the right orientation to contribute in a similar way as in hPAH 
to the hydrophobic binding site [25], although rotation into another 
side-chain rotamer might improve the L-Phe binding properties (Fig. 
2c). Other adjacent conserved residues that promote L-Phe binding 
are Proll8, Hisl22, Trpl63 and Phel68 where the two latter are 
found opposite to Proll8. hPAH has a proline residue where IpPAH 
has Glnll6. There are three consecutive glycines (Glyl81, Glyl82, 
Glyl83) in IpPAH corresponding to Gly, Ala and Gly in hPAH and in 
cvPAH. These glycines are also found in cold-active cpPAH which has 
low substrate affinity ([S0.5](L-Phe) = 1.1 ± 0.1 mM) [15]. Assum- 
ing that the current orientation is maintained in L-Phe-bound IpPAH, 
the three consecutive glycines (Glyl81, Glyl82, Glyl83) and Tyrll4, 
might explain the observed relatively low affinity of this enzyme for 
L-Phe binding. 

2.5. Thermal stability of IpPAH 

Characterization of the thermal stability of iron-free apo-lpPAH 
both by circular dichroism and differential scanning calorimetry (DSC) 
has revealed a thermostable enzyme with Tm = 79 ± 0.5 °C [8], which 
is much higher than the Tm-values measured for PAH from eukary- 
otes or other mesophilic bacteria studied so far, clustering around 
55 °C [9,26,27]. In fact, IpPAH shows a higher Tm-value than the apo 
form of PAH from the thermophile Chloroflexus aurantiacus (caPAH; 
Tm ~ 64 °C) [12]. Moreover, similar Tm-values are measured for the 
Fe(II)-bound holoenzymes of both IpPAH {Tm - 80 °C) and ther- 
mophilic caPAH since the binding of Fe(II) seems to be a large stabi- 
lization determinant for caPAH [12] whereas it has almost no effect 
on the stability of IpPAH [8]. Thus, it seems that specific structural 
determinants are related to the intrinsic stability of apo-lpPAH, and 
we notably focused on specific clusters of ionic residues, hydrophobic 
residues including aromatic clusters, and dimerization. 

2.6. Polar interactions and amino acid sequence content 

As hydrogen bonds and ion-pair content appear to be related to 
protein thermostability and thermal adaptation, see e.g. [28,29], we 
analyzed these interactions in IpPAH (Table 2) and compared them 
with those in cpPAH, cvPAH and hPAH [15]. The IpPAH amino acid se- 
quence has 45%, 38% and 19% sequence identity to cpPAH (Tm = 52 °C) 
[15], cvPAH (Tm = 64 °C) [27] and hPAH (Tm = 53 °C) [15], respec- 
tively (Fig. SI; Supplemental information). IpPAH has 0.84-0.88 hy- 
drogen bonds per residue (Table 2). Comparatively, cpPAH was found 
to have a higher number of hydrogen bonds per residue (0.981 for 
holo-cpPAH, PDB 2V27, and 0.951 for apo-cpPAH, PDB 2V28) [15]. 
It is however important to keep in mind that these numbers are 
dependent on resolution of the X-ray structure and refinement re- 
strains. Furthermore, for many thermophilic enzymes, extended ion- 
pair clusters are shown to explain their high thermostability (see e.g. 



Table 3 

Residue content analysis, (a) Residues in an aromatic cluster and (b) number of Phe (F), 
Tyr (Y), Trp (W) and His (H) residues in IpPAH, cpPAH, cvPAH and truncated hPAH. 



Residue No. IpPAH cpPAH cvPAH hPAH 



(a) 










91 


F 


F 


F 


F 


92 


F 


F 


F 


L 


100 


F 


F 


F 


F 


162 


F 


Y 


Y 


Y 


139 


F 


Y 


Y 


Y 


143 


Y 


Y 


Y 


I 


205 


F 


F 


F 


L 


211 


F 


L 


M 


A 


236 


F 


I 


A 




244 


F 


L 


L 




(207) 


(P) 


(I) 


(L) 


(L) 


No. of 


10 


7 


7 


4 


aromatic 










residues in 










cluster 










(b) 












ipPAH 


cpPAH 


cvPAH 


hPAH 
(103-428) 


In Gene (no. 


272 


267 


297 


326 


aa) 










No. ofF 


20 


16 


17 


22 


No. of Y 


13 


12 


12 


20 


No. of W 


4 


4 


7 


3 


No. ofH 


4 


5 


6 


10 


Total No. 


41 (15.1%) 


37(13.9%) 


41 (14.1%) 


55(16.9%) 


(%)ofF, Y, 










W,H 











[28,29]). However, for IpPAH both the total number and the number of 
ion-pairs per defined residue are low (0.017-0.052; Table 2) and the 
largest ion-pair cluster (<4 A) comprises only three residues. Thus, 
from the current IpPAH structure the hydrogen bonds and salt-pairs 
do not seem to explain the high thermotolerance of this enzyme. 

But when analyzing the amino acid sequence, the high Arg/ 
(Arg + Lys) ratio (0.67/0.39/0.61/0.44 in Ip/cp/cv/hPAH), the low 
number of Gly residues (10/13/14/15 in Ip/cp/cv/hPAH) and the Pro 
content (14/12/14/12 in Ip/cp/cv/hPAH), might all contribute to the 
thermostability in IpPAH compared to other PAHs, as inferred from 
comparative analyses of thermophilic proteins versus mesophilic and 
psychrophilic orthologs aiming to reveal mechanisms of temperature 
adaptation [15,30]. While arginine residues appear to be superior to 
lysines to withstand high temperatures, fewer glycines and additional 
prolines are associated to a reduction in conformational flexibility 
[30-33]. 

2.7. Aromatic interactions and the unique dimerization interface 

One striking feature in IpPAH is an extended aromatic cluster pro- 
truding from the conserved active site residues Phe91, Phe92 (Leu in 
hPAH), PhelOO, Phel39 (Tyr in cp, cv, hPAH) and Phel62. In IpPAH the 
cluster is extended by residues Tyrl43 (Tyr in cp, cvPAH), Phe205 (Phe 
in cp, cvPAH), Phe211, Phe236 and Phe244, and enclosed by Pro207 
(Fig. le). In total there are 10/7/7/4 aromatic residues in Ip/cp/cv/ 
hPAH in this aromatic cluster found in one single IpPAH chain (Table 
3). 

There is another aromatic cluster in IpPAH (Trp28, Phe32, Phe47, 
Phel21, Phel25) lined by Pro62 and Prol29 which is similar in the 
compared PAHs except hPAH which has one Cys (Phel23 in IpPAH). 
Structures of the other aromatic amino acid hydroxylase family mem- 
bers tryptophan hydroxylase (PDB IMLW, 3E2T, 3HF6) and tyrosine 
hydroxylase (PDB ITOH, 2T0H, 2XSN) also contain a high number 
of aromatic residues (Phe, Tyr, Trp, His; data not shown). For the 
compared PAH structures the number of phenylalanine residues is 
relatively high in IpPAH (20/16/17/22 in Ip/cp/cv/hPAH), and many 
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Table 4 

Structure-energetics correlations. Tlie theoretical unfolding changes in heat capacity 
(ACp) and enthalpy at 60 °C (AHeo), and at 79 °C (the Tm-value for IpPAH) (AH79), 
calculated from the changes in apolar and polar accessible surface area (AASAap and 
AASAp) based on the crystal structure of dimeric IpPAH for the A and D or B and C 
chains, respectively, that form two unique dimers. 





AASAap 
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8125.9 
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145.9 


2161^ 
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° The AH79, calculated from AHeo and ACp using the Kirchhoff equation. 

^ For comparison, the experimental, calorimetric enthalpy change (AH), calculated 

from the DSC scan in the absence of salt (Fig. 3a and Flydal et. al. [8], was 169.9 ± 0.2 

kcal/mol. 



of the additional aromatic residues are in the described clusters. For 
hPAH there are 326 residues in the characterized dimeric construct 
(Glyl03-Gln428) and the Phe content is lower (6.7%) than in IpPAH 
(7.4%) (Table 3). In general, aromatic residues can form stacking pi-pi 
interactions both in parallel and perpendicular fashion when their 
ring centers are closer than 7 A [34,35]. Mutation of one central 
aromatic residue in isocitrate dehydrogenase from the thermostable 
Thermotoga Maritima decreased the melting temperature by 3.5 °C 
[28]. Thus for IpPAH these two clusters probably contribute to its 
high thermal stability. 

Furthermore, a third relevant aromatic interaction is formed by 
Tyr257 in one subunit stacking with the same residue in the adjacent 
subunit in the dimer. The two Tyr257 residues are held in correct 
orientation by Pro255 and Pro256 (unique to IpPAH; Fig. SI) and 
face the corresponding residues in the other chain (Fig. Id). Phe258, 
which follows Pro255, Pro256 and Tyr257, is also specific to IpPAH 
and contributes to the aromatic/hydrophobic character of the most 
C-terminal part of the dimerization area. This aromatic cluster is thus 
essential to the formation of the dimerization interface in the enzyme 
from L pneumophila. This interface is unique among the aromatic 
amino acid hydroxylases reported to date, and covers a larger area 
and implies a much larger number of intersubunit interactions than 
the dimerization motif in the mammalian enzymes (Fig. Ic and see 
above). Furthermore, IpPAH is the only reported dimeric bacterial 
PAH, which strongly points to oligomerization as a mechanism for 
increasing the thermal stability in the bacterial scaffold, as suggested 
for other proteins (for a review see [31 ]). 

2.8. Effect of salt on the thermal denaturation 

In order to obtain thermodynamic insights on PAH from L pneu- 
mophila, and on the structure-energetics correlations, we investigated 
the thermal denaturation of the enzyme both in the absence and the 
presence of salt. As we have previously shown by differential scanning 
calorimetry (DSC), the unfolding of IpPAH at pH 7.0 in the absence of 
NaCl provides an endothermic transition with Tm = 79 ± 0.5 °C with 
a calorimetric enthalpy change (AH) = 169.9 ± 0.2 kcal/mol ([8] and 
Fig. 3a). The theoretical prediction of AH at the Tm (AH79) obtained 
by energy calculations using the crystallographic structure provides a 
higher enthalpy change {AHyg = 216/218 kcal/mol for chains AD/BC; 
Table 4), in agreement with the irreversible thermal denaturation of 
IpPAH and a partially structured denatured state. Actually, CD analy- 
sis at 100 °C have shown an increased negative ellipticity at 216 nm 
[8], indicating that the remaining structure in the denatured state in- 
cludes (3-structure, which might result from intersubunit formation 
of cross-|3 interactions [36]. 

When the thermal denaturation takes place in the presence of 
200 mM NaCl, macroscopic aggregation is clearly observed at the 
end of the thermal transition, which is in addition less endothermic 
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Fig. 3. Effect of salt on the thermal denaturation and aggregation of IpPAH. Thermal 
denaturation was monitored by (a) DSC and (b) DLS. IpPAH (30 \lM subunit) was heated 
in 20 mM Na-Hepes, pH 7.0 (black lines) and in the same buffer with 200 mM NaCl (grey 
lines). The scan rate was 1 K/min in DSC measurements (a) and the average size of the 
particles (Z-average) was estimated by scattering intensity measurements monitored 
every 3 °C after an equilibration time of 1 min (b). 



and very distorted by the aggregation associated exothermic process, 
hindering the determination of the Tm and AH at those conditions (Fig. 
3a). Still, the apparent Tm with salt is lower than without (Fig. 3a). This 
stimulating effect of salt on aggregation might be associated to the 
special dimerization area, including high content of (3-structure and 
intersubunit hydrophobic interactions surrounded by several ionic 
pairs (Fig. If). Hence, the presence of salt could initially favor the 
separation of the intersubunit ionic bonds and then reinforce the 
attractive hydrophobic interactions and enhance aggregation, as also 
inferred in studies of aggregation-prone proteins in different solvents 
including salt-free water (see e.g. [37]). 

Further insights on the salt effect on the aggregation propensity 
and type of thermal-induced aggregation of IpPAH were obtained by 
thermal dependent dynamic light scattering (DLS). As shown in Fig. 
3b, the onset of aggregation is retarded when the enzyme is heated 
in buffer without salt, and the size of the aggregates are > 100-fold 
larger, consistent with the aggregates in salt being visible to the naked 
eye. 

The special architecture of IpPAH at the intersubunit region, with 
a central core of (3-strands establishing intermolecular hydrophobic 
interactions and surrounded by ionic pairs, indicates that the central 
area might be most prone to aggregate in a (3-type of interaction. We 
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analyzed the aggregation propensity of the protein using the TANGO 
algorithm [38], which identified residues 163-168 and 225-229 as 
those having the highest propensity to denaturation. While residues 
225-229 indeed comprise the (3-strand at the intersubunit interface 
in the dimer (Fig. If), residues 163-168 are located on the last turn 
of an cx-helix consecutive to the former strand, and well oriented to 
form a cross- (3-aggregation upon a local conformational change in 
the helix. Serpins and prion proteins are prototype of this kind of 
aggregation [39], but it has also been found in other proteins that do 
not form amyloid fibers upon aggregation [36]. 

2.9. Physiological relevance of high thermal stability 

It is not known whether thermal stability extends to other pro- 
teins from L. pnemophila. To our knowledge, IpPAH is the only L. 
pneumophila protein for which the denaturation temperature has 
been determined. Although it is not obvious why an enzyme from 
L. pneumophila would withstand temperatures well above normal 
growth-conditions, it is interesting that L pneumophila is able to en- 
ter a dormant, viable but non-culturable (VBNC) state in response to 
low nutrient-availability and other environmental stress conditions 
[18]. These bacteria can be resuscitated upon entering a permissive 
amoebal host and continue both growth and pathogenicity [40,41 ]. It 
will be interesting to investigate if high thermostability is a general 
trait of L pneumophila proteins or if it is a special property of some 
few, including PAH. In this later case it is tempting to speculate if 
IpPAH could be preserved during VBNC due to a beneficial role upon 
resuscitation. 

2.10. Conclusion 

The presented crystal structure of thermostable phenylalanine 
hydroxylase from Legionella pneumophila (IpPAH) showed a dimeric 
structure with the unique Tyr257 interacting with the same residue 
in the adjacent subunit, stabilized by the adjacent Pro255, Pro256 
and Phe258 residues. The unique dimeric interface include ion pairs 
and aromatic interactions, and two additional aromatic clusters per 
monomer are the plausible structural determinants of the thermal 
stability of IpPAH. In the presence of salt (200 mM NaCl), macroscopic 
protein aggregation was observed, compatible with an effect of salt 
both destabilizing the ion-pairs at the dimer interface and reinforcing 
the attractive hydrophobic interactions. 

3. Material and methods 

3. 1 . Overexpression and purification of IpPAH 

Recombinant wild-type (wt) IpPAH was overexpressed in E. coU 
strain BL21-Codon Plus(DE3)RIL as fusion protein (His)6-ZZ-lpPAH 
with a Tobacco Etch Virus (TEV)-cutting site between the His-tag 
and ZZ carrier as described [8]. The (His)6-ZZ-lpPAH fusion protein 
was cut with (His)6-tagged TEV protease overnight at 4 °C and the 
isolated IpPAH was obtained by collecting the flow-through from a 
TALON column, followed by buffer exchange in a desalting (PD-10) 
column into 20 mM Hepes, pH 7.0 and 200 mM NaCl. 

The protein concentration of IpPAH was measured spectropho- 
tometrically using the extinction coefficient £280 = 1-20 (mg/ 
mL)~^ cm~^ The purified protein was stored in liquid nitrogen. 

3.2. Crystallization, structure solution and refinement 

Crystallization experiments on IpPAH were performed with a 
Phoenix crystallization robot (Art Robbins Instruments), at room tem- 
perature in 96-well format with MRC plates. Volumes used were 60 
|il reservoir solution per well, and drops from 0.5 |xl well solution 
plus 0.5 |j.l protein solution with 15-21 mg/ml protein in sitting 



drop experiments. Initially 480 different in-house made conditions 
were screened followed by optimization around several successful 
conditions. Still, addition of glycerol, sugars, low molecular weight 
polyethylene glycol (PEG) and change in pH and precipitant concen- 
tration, only improved the crystal quality slightly. The largest crystals 
grown had a protein concentration of 15-17 mg/ml and with reser- 
voir solutions containing 1.6 M NaK2P04 and 4% (v/v) PEG 400. The 
crystals were cryo protected in 1.6 M NaK2P04 and 27% etylene glycol, 
and then flash frozen in liquid nitrogen. 

X-ray data for IpPAH was collected at Bessy, BL 14.1 at 100 K with 
a wavelength of 0.91841 A, 6 s exposure, 0.25° oscillation per frame, 
and in total 103° of data was used in the final data set. The data was 
integrated and scaled with the program XDS [42] and structure factors 
were obtained using TRUNCATE [43]. The IpPAH structure was solved 
by molecular replacement (MR), with a homology model made from 
Colwelliapsychrerythraea 34H PAH (cpPAH; PDB 2V27; 45% sequence 
identity) [15] as search model with the program PHASER [44]. Three 
molecules were first identified from the MR solution and inspection 
of the electron density identified the fourth molecule that was further 
down in the MR solution list. 

The structure was refined with REFMAC5 [45 ] and manually rebuilt 
in WinCoot [46]. Translation Libration Screw-motion (TLS) refinement 
was utilized due to disorder in some of the chains. Phenix was also 
tried as refinement program but it did not improve the statistics nor 
the electron density maps. 

In this paper we have compared our IpPAH structure to cpPAH 
(PDB 2V27) [15], CVPAH-BH2 (PDB ILTZ) [47], hPAH-Fe(II)-BH4 (PDB 
1J8U) [25], hPAH-BH4-thienylalanine (THA) (PDB IMMK) [48] and 
hPAH (PDB 1 PAH) [21 ]. hPAH in these structures is a dimeric truncated 
form including residues 103-428. 

3.3. Differential scanning calorimetry (DSC) 

DSC was performed using a MicroCal VP-DSC microcalorimeter 
(GE Healthcare). A sample of 30 fxM subunit IpPAH in degassed 20 mM 
Na-Hepes, pH 7.0, with or without 200 mM NaCl, as indicated, was 
heated from 37 to 90 °C using a scan rate of 1 K/min. A buffer-buffer 
reference trace was subtracted and the data was normalized with 
respect to concentration to obtain excess heat capacity (Cp) as a func- 
tion of temperature using the MicroCal-enabled Origin 7.0. software. 
These curves were then analyzed to determine the midpoint denat- 
uration temperature (Tm) and the calorimetric enthalpy change (AH) 
for the unfolding transitions. 

3.4. Dynamic light scattering (DLS ) 

Thermally induced aggregation of IpPAH was measured by DLS 
using a Nanosizer S (Malvern Instruments, Sweden) equipped with a 
He-Ne laser (633 nm) and a fixed 173° back scattering angle. 30 m-M 
subunit IpPAH in 20 mM Na-Hepes, pH 7.0, with or without 200 mM 
NaCl, was heated from 37 to 90 °C. The average size of the particles 
(Z-average) was estimated by scattering intensity measurements and 
monitored as a function of temperature with measurements every 
3 °C after an equilibration time of 60 s. 

3.5. Structure-based theoretical unfolding enthalpy values and analysis 
of aggregation propensity 

Calculation of the theoretical unfolding heat capacity change 
(ACp) and enthalpy change at 60 °C (AHeo) was performed using 
the structure-energetics relationships developed by Freire and co- 
workers [49], as explained [26], based on the apolar and polar ac- 
cessible surface area (AASAap and AASAp, respectively). AASAap and 
AASAp were calculated by Getarea [50] using the crystal structure of 
dimeric IpPAH (either chains A and D or chains B and C which form 
two unique dimers in the asymmetric unit). 
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The TANGO algorithm (http://tango.crg.es/) [38] was used to pre- 
dict regions involved in (3-type aggregation in dimeric IpPAH. 

Protein structure Accession numbers 

Coordinates and structure factors for the IpPAH structure have 
been deposited in the PDB with accession code 4BPT. 
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